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Abstract. We show by a surprisingly simple argument that the ex- 
changeability condition, which is key to the exchangeable pair approach 
in Stein's method for distributional approximation, can be omitted in 
many standard settings. This is achieved by replacing the usual antisym- 
metric function by a simpler one, for which only equality in distribution 
is required. In the case of normal approximation we also slightly improve 
the constants appearing in previous results. For Poisson approximation, 
a different antisymmetric function is used, and additional error terms are 
needed if the bound is to be extended beyond the exchangeable setting. 

1. Introduction 



In the context of normal approximation, a variant of Stei n's method that 



is of ten us e d is t he exchangeable pair coupling introduced in 



and 



Stein 



Diaconid (|l977l 'l 



19861). There are n iany app l ication s based o n this coupling, see 



e.g. 

but a. 



Rinott and Rotail dlMZl) 



FulmanI |2004bl l. iMecke; 



(120051) 



s o in the context of non-normal a ppro ximation such as 



(l200ffl and others 



Chatterjee and FulmanI (j2006l ) and 



Rollin 



Chatterjee et al. 



Assume that is a ran dom variable which we want to approximate. The 



key concept introduced by ISteinI (|l986l ) is that of auxiliary randomization. 



In the context of exchangeable pairs, this means that we construct a random 
variable W on the same probability space such that {W, W') is exchange- 
able, that is ^{W,W') = J^'{W',W) and, in general, the pair should be 
constructed such that \W' — W\ is small. One can then prove for instance 
a bound as in Theorem 12.11 below (often under additional conditions on the 
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exchangeable pair such as Equation (12. 1|) ). If W and W are two consec 



utive steps of a reversible Markov chain in equilibrium, 



Rinott and Rotar 



(jl997l ) note that exchangeability auto maticallv follows 



constru cted in some of the examples of 



Rinott and Rotar 



iowev er, t 



le pairs 



19971 ) and 



Fulman 



|2004al ) are based on non-reversible Markov chains and therefore some ef- 
fort has to be put into showing that the pairs satisfy the exchangeability 
condition. 

The key fact to prove a bound as in Theorem l2.1l is that, for any antisym- 
metric function F, exchangeability of {W, W') implies the identity 



fl.ll 



]SF{W, W) = 



see (ISteinl . 



19861 . p. 10)). In fact, this is often the only place where ex- 
changeability is used. For example, in the case of the normal distribution, 
the standard choice for F is 



(1.2) 



F{w,w') = {w' -w){f{w') + fiw)), 



where / is the solution to the Stein equation 



(1.3) 



f'{x) - xf{x) = h{x) - E/i(Z) 



and Z has the standard normal distribution. We show in this paper that 
instead of the choice (11.21) we can take the simpler function 



(1.4) 



F{w,w') = / f{x)dx- / f{x)dx, 
Jo Jo 



which is, in particular, antisymmetric, but for which (11.11) of course holds 
without exchangeability as long as ^{W) = ££{W). If exchangeable pairs 
are used in a discrete setting, the integrals in p.4|) of course have to be 
replaced by corresponding sums. 

In the following section we restate and prove results obtained by the ex- 
changeable pairs approach for normal and Poisson approximation. The main 
purpose is to demonstrate in detail how the exchangeability condition can be 
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omitted in some standard settings, sometimes also yielding better constants. 
With the approach in this paper, however, it is possible t o rem o ve th e ex- 
changeabilitv c ondition in manv other s i tuatio ns, su ch as in 



Meckef 



mm 



Chatterjee and FulmanI |2006l ). and 



Rolhn 



SteinI (Il995|) and 



20071). 



Most ingredients in the following proofs are taken directly from the proofs 
of the corresponding papers; hence many details have been omitted. 



2. Main results 

2.1. Normal approximation. For normal approximation we need some 
more assumptions. Assume that is a random variable with JEW = and 
YarW = 1. Construct an exchangeable pair such that 

(2.1) IS^W' = {I- X)W + R 

for some constant < A < 1 and some random variable R, where denotes 
the conditional expectation with respect to W. 

The following calcula tions are essential for the proof of Theorem 1.2 of 



Rinott and Rotan (|l997l ) , but we now assume for the sake of clarity that R = 
0. Let / be the solution to (ll.3|l for a Lipschitz-continuous test function h. 
Now, with the standard choice of F as in (II. 2|) and exchangeability, we have 

= EF(W, W') 
= ]S{iW' -W){fiW') + f{W))} 

(2.2) 

= lE{iW' - W){2fiW) + fiW) - f{W))} 

= -2X]S{Wf{W)} +'E{{W' -W){f{W') - f{W))}, 

where for the last equality we used (12.11) . Let now r be a random variable 
uniformly distributed on [0,1], independent of all other random variables, 
and put V = W — W. Noting that the second derivative /" exists almost 
everywhere because h is Lipschitz continuous, Taylor's expansion yields 

f(W') = f{W) + Vf'iW) + V^-E^'^'{{1 - T)f"{W + tV)}. 
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Thus, from (EH 



(2.3) lE{WfiW)} = ^E{y2/'(VF)} + _ r)f{W + rV)}. 

In the proof of Theorem 12. II we show that an equation similar to (|2.3I) can 
be deduced without exchangeabihty. To state the theorem, we need some 
notation. Define for a given function h and e > 0, 

h^{x) = sup{/i(x + y) : \y\ ^ e}, h'^ {x) = inf{/i(x + y) : \y\ ^ e}. 



Let W be a class of measurable functions on the real line such that for all 
h Gfi, we have \\h\\ ^ 1, where ||-|| denotes the supremum norm; for any real 
numbers c and d, h £ Ti implies h{c ■ +d) S TC; for any e > 0, h G Ti. implies 
hf,hj £ TC and there is a constant a (depending only on the class TC) such 
that ] Ei[ht(Z) — hjjZ)} ^ ae where Z has standard normal distribution. 



As in 



Rinott and Rotail (jl997l ) we assume without loss of generality that 



TT. 



Theorem 2.1 (cf. Theorem 1.2 of iRinott and Rotan (119971 )). Assume that 
W and W are random variables on the same probability space such that 
^{W') = ^{W), Ely = 0, VarTy = 1. Assume that ([211]) holds for some 
A and R. Then, for 6 := suphenl^HW) - E/i(Z)|, 



6 



(2.4) S^j^Ya.r-EW(W'-Wy+ ^^ + A ' 

//, in addition, there is a constant A such that \ W' — W\^A almost surely, 
we have 



(2.5) .6 ^ l^JY^rlSW{W'-Wy + 37vW ^ ^ ^A? 

A V A A y A 



Proof. Prom Lemma 4.1 of 
< t < 1, 



Rinott and Rotan (jl997l ) we have that, for any 



(2.6) 



<5 ^ 2.8 sup|E/it(VF) - E,ht{Z)\ + 4.7at, 
hen 
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where ht{x) = E/i(x + tZ). Let / be the solution to the Stein equation 



(2.7) 



Then, / satisfies 



(2i 



f'ix)-xf{x) = htix)-]EhtiZ). 



^2.6, ||/'|K4, ||/"|K2||/i^|K i.et-i; 



recalhng that \\h\\ ^ 1, the first two bo unds and the first 



one follow from Lemma 3 of 



SteinI |l986l ) and, as noted by 



part of the third 



Rinott and Rotar 



(|l997l ). the second inequality of the third bound can be deduced using the 



equality = —t ^ j h{x + ty)ip' {y)dy , where (p is the standard normal 



density, so that \\h[\\ ^ t ^ J \ip'{x)\dx = t ^ ^Ijix . 
Define the function 



(2.9) 



G{w) 



f(x) dx 



and note that |G(i(;)| ^ lif^lll/ll, so that lEG(VF) exists. By Taylor's expan- 
sion, we have 



(2.10) 



G{W') = G{W) + Vf{W) + \V^f[W) 



where, again, V = W — W . Thus, together with (|2.ip . we obtain 

= EG(VF') - mG{W) 
= -XM{Wf{W)] + M{Rf{W)} 

+ \^{v^f{w)] + iE{y3(i _ Tff"{w + tV)], 

which can be rearranged to obtain the following analogue of (|2.3p 

m{wf{w)] = \^{v'^f'{w)] + _ Tff"{w + tV)] 

(2.11) 

+ E{i?/(Vr)}. 
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With a := ]E{RW} and noting that EF^ = 2(A - a) from ([211, we thus 
have from ([221) and (ITTTIl 

X{mt{W) - mt{Z)) = E{((A - a) - lv^)f'{W)} 

+ ]S{af'{W)-Rf{W)} 
-iE{y3(i_r)2/"(Ty + ry)} 
=: Jl + J2 — ^Js- 



Using (12.81) . we obtain the estimates 



1 6 

|JiK 2VVarE^y^ | J2K 6.6VrR2, IJ3K — E|y|^ 



for details see 



Rinott and Rotar 



imi ). Choosing t = 0.4(E|y|V(aA))^^^ 
and with (|2.6I1 . this proves (|2.ip . 

Assume now that |y| ^ A. Note that because of (12.711. f ^^fa;) = f{x) + 



xf'{x) + h[(x). Following the proof of 



Rinott and Rotar 



19971 ) , but recalling 



that in our remainder J3 the term (1 — r) is squared, we have 

I J3I = E{y^(l - Tf{f{W + tV) + {W + TV)f'{W + tV) + h'tiW + tV)) } 

^ 0.9^3 + 1.4E{y2(|VF| + \W'\)} + E{1/3(1 - Tfh[{W + tV)} 

^ 3.7A^ + E{y3(l - rfKiW + tV)}, 
where the latter expectation can be bounded by 

„43 A3 

|E{y3(i _ r)X{W + rV)}\^— + —{26 + aA); 



see agam 
obtain 

(2.12) 



Rinott and Rotar 



19971 ). Collecting all the bounds on the Jj we 



\mt{W) - E/it(Z)| ^ -\/Va7E^n^+ — \/RR2 

A A 

3.7^3 aA^ A^(26 + aA) 
+ + ■ 



2A 



6A 



6Xt 
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Recalling that a ^ putting (I2.12p into (|2.6p and with the choice 

t = f)22A{^^2^^f'\ we finally have 

5 ^ S.eVVarEW^ + 18.5VEi?2 + 7 + 3 + 4.2 

A ^/A 

This inequality is of the form 5 ^ a + bV6, for which we can show that 

(5 ^ 2a + 6^ and which hence proves (12.5(1 . □ 

Note that if -R = almost surely, equation (|2.11|) is the same as (|2.3|) . 
except that the factor (1 — r) is squared; this is the only reason for the 
improved constants. If (W, W) is exchangeable, both equalities (|2.3|) and 
(|2.11|) hold. At first glance this may seem to be a contradiction, but the 
remainders with the second derivatives are in fact equal. To see this, write 

]S{V^{l-Tff"iW + TV)} 

= E{y3(l - T)f"{W + TV)} -]S{V^T{l-T)f"{W + tV)}. 

We need only show that the second term on the right hand side is equal 
to zero. Note to this end the simple fact that ^{t) = ^(\ — r), thus, 
using this in the following calculations to obtain the first equality and the 
exchangeability of (W, W') for the third equality, 

E{(Pr' - WfT{\ - T)f{W + T(Ty' - W)) } 

= E{(1^' - WfT(\ - T)f"{W + (1 - t){W' - W))} 
= E{ {W' - Wfril - T)f" {W' + t{W - W')) ] 
= E{ (M/ - W'fT{l - T)f" {W + t{W' - W)) } 
= -]S{{W' - WfT{l - T)f" {W + t{W' -W))}, 

which proves the claim. 




2.2. Poisson approximation. The situation in the discrete setting, in which 
W takes values only in Z, is more delicate. Assume that / is the solution 
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to a Stein equation of a discrete distribution. Instead of (|2.9p . define the 
function G as 

W — UI— 1 



(2.13) 



fc=i 



fc=0 



where here and in what follows Ylik=a defined to be zero if 6 < a. For 
many standard distributions, / will be bounded, thus ^ |w|||/||, so 

that EG(PF) exists if E|T^| < oo. 

With such G, one verifies that G{w) — G{w — 1) = f{w) for all w £ 
Define h = I[W' -W = i] and /\iG{w) := G{w + i) - G{w) for all i G Z; 
then, 



(2.14) G{W') - G{W) = h^iGiW) 

and thus, if ^{W) = ^{W), 
(2.15) 

= ^G{W')-'^G{W) = ^E{/iAiG(VF)} = ^ E{i^,(VF)A,G(H^)} 



where Pi{W) := F^[W' -W = i]. 
As mentioned in the introduction, 



Rinott and Rotail (jl997l ) suggest con 



structing W and W via an underlying stationary Markov process, which 
implies ^{W) = ^{W); if the chain is reversible, exchangeability follows 
immediately. However, the Markov chain that they use for the anti-voter 
model is not reversible, and so exchangeability has to be proved separately. 
One way of doing this is to assume that 



(2.16) 



W -W e {-1,0,1}, 



al most surelv, whic h is th e main assumption in iRollinI (|2007l ). 

Chatterjee et al.l |2005l ) applied the method of exchangeable pairs to the 
Poisson distribution. Assume that (1^',M^) is an exchangeable pair taking 
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values only on the non-negative integers. Define the antisymmetric function 

(2.17) F{w,w') = f{W')I[W' -W = 1]- f{W)I[W' -W = -1]. 

This yields 

(2.18) 

= EF(W, W) 

= ]S{f{W')I[W' -W = 1]- f{W)I[W' -W = -1]} 

= ]S{]S^{f{W')I[W' -W = l]}- ]S^{f{W)I[W' -W = -1]}} 

= E{/(VF + l)¥[W' -W = l\W] - f{W)¥[W' -W = -l\W]] 

= ^{f{W + l)Pi{W) - f{W)P-i{W)]. 

We can use n ow the standard argum ent for Poisson approximation by Stein's 



method (see (Barbour et al 



1991 p. 6)). Denoting by Po(A) the Poisson 



distribution with mean A, it follows that 



(2.19) 



dT\{^{W),Y'o{\)) ^ snv\E.{\ f {W + I) -Wf{W)]\ 

^ sup|E{(cPi(W^) - \)f{W + 1) - (cP_i(VF) - W)f{W)]\ =: 
/ 



for any c > and where the supremum ranges over all solutions f = /a 
the Stein-equation 



(2.20) 



A/(j + 1) - jfU) = I[j ^A]- Po{X){A}, 



for subse ts A of the non-negativ e integers. For many applications investi- 



gated by 



Chatterjee et al 



(j2005l ). the following further bound on Kc is used: 



^ A~^/2(E|cPi(PF) - A| + ]S\cP-i{W) - W\), 



which results from the well known bound ||/|| ^ A (see ( Barbour et al 
I992I . Lemma 1.1.1)). However, it is at times beneficial to work directly with 



the expression in (|2.19p in the hope of re-arranging things so that the better 
bound IIAi/ll ^ (1 -e-^)A-i may be applied. 
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It may seem surprising that, although in the bound (|2.19|) only the jump 
probabilities of size 1 appear, no assumptions concerning the sizes of other 
jumps were made in the above calculations; we did not, for instance, assume 
condition (I2.16p . However, with the choice f{w) = I[w = k] for k £ 1j, 
we obtain from (|2.17l) the detailed balance equation for reversible Markov 
chains. Even if the chain makes jumps of size larger than 1, the stationary 
distribution is determined by the jump probabilities of size 1, so that in (|2.19ll 
the full information about the distribution under consideration is actually 
used, just by starting from (|2.17|1 . If exchangeability is not assumed, the 
effects of jumps of size larger that 1 have also to enter, and this is reflected 
in the bounds of the next theorem. The choice of F in (|2.17ll is fundamen- 
tally different from the standard choice in the continuous setting, where we 
obtained the same bounds as before, but under weaker assumptions. 



Theorem 2.2 (cf. Proposition 3 of lChatterjee et al.l (120051)). Let W and W 



be non-negative random variables such that ^(W') = ^{W). Then, for any 
constant c> 0, 

(2.21) dTV {^{W) , Po(A)) (^Kc + cp 
where p satisfies the bounds 

(2.22) p ^ ^ i Y,\Pk,k+i - Pk+i,k\ ^ \i\]SPiiW), 
for pkj = P[W = k, W' = j] and the Pi are as before. 

Proof. Taking expectation over (|2.20p with respect to W , using (|2.15|1 and 
noting that AiG{W) = f{W + 1) and /\-iG{W) = -f{W), we obtain 

E{A/(VF + 1) - Wf{W)] 

= E{(A - cPi{W))f{W + 1)-{W- cP.i{W))f{W)] 
+ c^E{/iAiG(Pr)+/_,A_iG(ty)}, 

i>2 
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where, as before, = I[W' — W = i]. Now, it is easy to see that 
(2.23) E{l,AiG{W) + UA^iG{W)} = Y,{vk,k+i - Pk+i,k)AiG{k) 

Recalhng the bound ||/|| ^ A^^/^ for all solutions / of (|2.20l) . and hence 
|AiG(l^)| ^ |i|A-i/2 for all i G Z, yields the first bound of (IT^ . 

Now, as Pk,k+i = ^[W = k]Pi{k), the second bound follows from the first. 

□ 

Under condition (12.161) , Theorem 12.21 and estimate (I2.19P clearly yield the 
same bound. If (|2.16|) does not hold, exchangeability is not automatically im- 
plied. Then the first bound of (|2.22|) is a measure of the non-exchangeability 
of (VF,VK'); if the pair is in fact exchangeable, this term vanishes and we 
regain (|2.16|) . The second bound in (I2.22p is particularly useful if the jump 
probabilities of larger jumps are small. 



3. Discussion 



The key to understand the 



introduced in 



Barbouil (119881 ) . Recall that 



present approach is the generator interpretation 



iAG){x) = G"{x) - xG'{x) = f'{x) - xf{x) 



is the generator of the Ornstein-Uhlenbeck diffusion, applied to the func- 
tion G. Now assume that a Markov chain {X{n);n G Z+} with station- 
ary distribution ^{W) is given and let (VF,M^') = (X(0),X(1)), assuming 
that the chain starts in its equilibrium. Construct a Markov jump process 
Z = {Z{t);t ^ 0} by randomising the fixed steps of size 1 of the Markov 
chain X and wait instead an exponentially distributed amount of time with 
rate 1/A. It is easy to see that, under the assumption of (|2.1|) . the infinites- 
imal operator of Z is 

E[G(Z(/i)) I Z(0) = x]- G{x) _ WG{W') - G{x) 



(^G)(x) = lim 



h 



A 
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-xG'ix) + -^G"{x) 



w{{v - tfi[v > t > or y < t < 0]} 



2A 



G'-^\x + t)dt 



where W denotes E{- = x} and V = W'-W. Thus, we in fact compare 
the infinitesimal operator of this jump process with the generator of the 
Ornstein-Uhlenbeck diffusion. 

The antisymmetric function F{w,w') = {w' — w){f{w') + f{w)) in fact 
also (almost) calculates this infinitesimal operator, but with the first step of 



In the case of 



Stein 


( 1995 


) and 


Meckes 


(2nn6) 



jump processes {Zm',rn G N} with infinitesimal operators 

WG{Wm) - G{x) 



{BmG){x) 



where e^^A ^ are the jump rates and by letting e — > we show that the 
processes Zm converge to a diffusion process with infinitesimal operator 

{BG){x) = (1 + iA-iE^'E)G"(x) - xG'ix) 

for some specific random variable E. This diffusion has the same linear 
drift as the Ornstein-Uhlenbeck diffusion, but a non-constant diffusion rate. 
Stei n's method now allows us to state through the approaches in 



Steinl (119951) 



and iMeckesI (|20Q6l ) that the less lE^E fluctuates around zero, the nearer the 
stationary distribution of the process is to the stationary distribution of 
Ornstein-Uhlenbeck diffusion, that is, to the standard normal distribution. 
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